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^ What makes the differences: benchmarking XML database implementations 

Hongjun Lu, Jeffrey Xu Yu, Guoren Wang, Shihui Zheng, Haifeng Jiang, Ge Yu, Aoying Zhou 
February 2005 ACM Transactions on Internet Technology (TOIT), volume 5 issue i 
Publisher: ACM Press 

Full text available: ^pdf(589.14 KB) Additional Information: full citation , abstract , references , index terms 

XML is emerging as a major standard for representing data on the World Wide Web. 
Recently, many XML storage models have been proposed to manage XML data. In order 
to assess an XML database's abilities to deal with XML queries, several benchmarks have 
also been proposed, including XMark and XMach. However, no reported studies using 
those benchmarks were found that can provide users with insights on the Impacts of a 
variety of storage models on XML query performance. In this article, we report our ... 

Keywords: XML query processing, XML storage model, benchmark 



2 XML: XML screa mer: an integ rated ap pr oach to hi gh performance XML parsin g, 
val ida tio n and deserializatio n 

Margaret G. Kostoulas, Morris Matsa, Noah Mendelsohn, Eric Perkins, Abraham Helfets, 
Martha Mercaldi 

May 2006 Proceedings of the 15th international conference on World Wide Web 
WWW "06 

Publisher: ACM Press 

Full text available:^ pdff 303. 88 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes an experimental system in which customized high performance XML 
parsers are prepared using parser generation and compilation techniques. Parsing is 
integrated with Schema-based validation and deserialization, and the resulting validating 
processors are shown to be as fast as or in many cases significantly faster than traditional 
nonvalldating parsers. High performance is achieved by Integration across layers of 
software that are traditionally separate, by avoiding unnecessar ... 



Keywords: JAX-RPC, SAX, XML, XML schema, parsing, performance, schema compilation, 
validation 
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Craig Anslow, Stuart Marshall, Robert Biddle, James Noble, Kirk Jackson 

January 2004 Proceedings of the 2004 Australasian symposium on Information 

Visualisation - Volume 35 APVis '04 
Publisher: Australian Computer Society. Inc. 

Full text available: g pdf(556.11 KB) Additional Information: full citation , abstract , references , index terms 
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Program traces can be used to drive visualisations of reusable components, but such 
traces can be gigabytes in size, are very expensive to generate, and are hard to extract 
information from. We have developed a solution to this problem, an XML Data Storage 
Environment (XDSE) for storing XML based program traces in a native XML database. We 
use XQuery to extract information from the program traces and the results are then 
transformed into understandable visualisations. 

Keywords: XQuery, component reuse, native XML databases, program traces, software 
visualisation 



4 Practitio ners r e port: A gile reg ressio n testing using r ecord & pla yback 
^ Gerard Meszaros 

^ October 2003 Companion of the 18th annual ACM SIGPLAN conference on Object- 
oriented programming, systems, languages, and applications OOPSLA 
•03 

Publisher: ACM Press 

Full text available: ^ pdf(267.25 KB) Additional Information: full citation , abstract , references , index terms 

There are times when it is not practical to hand-script automated tests for an existing 
system before one starts to modify it (whether to refactor it to permit automated testing 
or to add new functionality). In these circumstances, the use of "record & playbacl<" 
testing may be a viable alternative to handwriting all the tests. This paper describes 
experiences using this approach and summarizes key learnings applicable to other 
projects. 

Keywords: JUnit, XML, acceptance test, automated testing, best practices, functional 
test, patterns, playback, record, robot user, user interface 



5 XML: Sche mapath. a minimal extension to xml schema for conditional constraints 
Claudio Sacerdoti Coen, Paolo Marinelli, Fabio Vitali 

May 2004 Proceedings of the 13th international conference on Worid Wide Web 
WWW '04 

Publisher: ACM Press 

Full text available* pdf d 98.40 KB ) A^*^'*'^"^' Information: full citation , abstract , references , citings , index 

terms 

In the past few years, a number of constraint languages for XML documents has been 
proposed. They are cumulatively called schema languages or validation languages and 
they comprise, among others, DTD, XI^IL Schema, RELAX NG, Schematron, DSD, xlinkit. 
One major point of discrimination among schema languages is the support of co- 
constraints, or co-occurrence constraints, e.g., requiring that attribute A is present if and 
only If attribute B is (or is not) presentin the same element. Although ... 

Keywords: co-constraints, schema languages, schemapath, xml 




6 Tools and environments: A survey of coverage based testing tools 
Qian Yang, J. Jenny Li, David Weiss 

May 2006 Proceedings of the 2006 international worlcsliop on Automation of 
software test AST '06 

Publisher: ACM Press 

Full text available:^ pdf ( 71 .48 KB ) Additional Information: full citation , abstract , references , index terms 

Test coverage is sometimes used as a way to measure how thoroughly software is tested. 
Coverage is used by software developers and sometimes by vendors to indicate their 
confidence in the readiness of their software. This survey studies and compares 17 
coverage-based testing tools focusing on, but ndt restricted to coverage measurement. 
We also survey additional features, including program prioritization for testing, assistance 
In debugging, automatic generation of test cases, and customization ... 
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Keywords: automate test case generation, code coverage, coverage-based testing tool, 
dominator analysis, eXVantage, prioritization 



7 Visualising reusable software over the web 

Stuart Marshall, Kirk Jackson, Robert Biddle, Michael McGavin, Ewan Tempero, Matthew 
Duignan 

December 2001 Proceedings of the 2001 Asia-Pacific symposium on Information 
visualisation - Volume 9 APVis '01 

Publisher: Australian Computer Society, Inc. 

Full text available- HI pdfd 38 MB) Additional Information: full citation , abstract , references , citings, index 
u e aval a e.i2J.&.a».. tenns 

This paper describes an architecture we have developed for web-based visualisation of 
remotely executing software. The motivation for this work is to allow users of web-based 
software repositories to explore existing code components and frame-works, to see what 
they dO; and create interactive visual documentation of that code based on the 
developer's actions. This visual documentation can be used to determine what the code or 
framework does, how it does it, and whether It can be reused In ... 

Keywords: code reuse, software visualisation, web-based code repositories 



8 Test as e g eneration: Testin g software modellin g tools using data mutation 
^ Lijun Shan, Hong Zhu 

>r May 2006 Proceedings of the 2006 international worlcsliop on Automation of 
software test AST *06 

Publisher: ACM Press 

Full text available: ^pdf( 126.47 KB) Additional Information: full citation , abstract , references , index tenrts 

|v|odelling tools play a crucial role in model-driven software development methods. A 
particular difficulty in testing such software systems is the generation of adequate test 
cases because the test data are structurally complicated. This paper proposes an 
approach called data mutation to generating a large number of test data from a few seed 
test cases. It is inspired in mutation testing methods, but differs from them in the way 
that mutation operators are defined and used. In our approach, mutat ... 



9 OOPSLA demonstrations chair's welcome: Web testing made easy 




Marc Guillemot, Dierk Konig 

October 2006 Companion to the 21st ACM SIGPLAN conference on Object-oriented 



programming systems, languages, and applications OOPSLA '06 

Publisher: ACM Press 

Full text available: ^ pdf(213.33 KB) Additional Information: full citation , abstract , references , index tenns 

In this paper we describe WebTest, an Open Source tool for automated testing of web 
applications. In particular we will show how to quickly create tests that shine with 
excellent maintainability and runtime performance as well as perfect integration in the 
application development cycle. 

Keywords: automatic acceptance test, change control, test driven development, web 
application 



10 Parsing, normalizing. & storing XML: A hig h- performance interpretive a p proach to 
^ schema-directed parsin g 

^ Morris Matsa, Eric Perkins, Abraham Heifets, Margaret Gaitatzes Kostoulas, Daniel Silva, 
Noah Mendelsohn, Michelle Leger 

May 2007 Proceedings of the 16th international conference on World Wide Web 

WWW '07 
Publisher: ACM Press 



http://portalacm.org/resultsxfm?coll=ACM&dl=ACM&CFID= 1 992 1 ^ 



Res^ults (page 1): XML and test report 



Page 4. of 6 



Full text available: ^ pdf( 228.86 KB) Additional Information: full citation, abstract, references , index temis 

XML delivers key advantages in interoperability due to its flexibility, expressiveness, and 
platform-iieutrality. As XML has become a performance-critical aspect of the next 
generation of business computing infrastructure, however, It has become increasingly 
clear that XML parsing often carries a heavy performance penalty, and that current, 
widely-used parsing technologies are unable to meet the performance demands of an 
XML-based computing infrastructure. Several efforts have been made to add ... 

Keywords: XML, compiler, Interpreter, parsing, performance, schema 



11 Technical papers: testing II: A framework for component deployment testing 
Antonia Bertolino, Andrea Pollnl 

May 2003 Proceedings of the 25th International Conference on Software 

Engineering ICSE '03 
Publisher: IEEE Computer Society 

Full text available: ^p^^^ ^ ^^ Additional Information: full citation , abstract , references , citings, index 

Publisher Site tois 

Component-based development is the emerging paradigm in software production, though 
several challenges still slow down its full taking up. In particular, the "component trust 
problem" refers to how adequate guarantees and documentation about a component' s 
behaviour can be transferred from the component developer to its potential users. The 
capability to test a component when deployed within the target application environment 
can help establish the compliance of a candidate component to the cust ... 

12 Research sessions: Research 16: XML views & filtering: AFilter: adaptable XML 
filtering with prefix-cachin q suffix-clusterin g 

K. Selguk Candan, Wang-Pin Hsiung, Songting Chen, JunichI Tatemura, DIvyakant Agrawal 
September 2006 Proceedings of the 32nd international conference on Very large data 

bases - Volume 32 VLDB'2006 
Publisher: VLDB Endowment 

Full text available: ^ pdf(744.54 KB) Additional Information: full citation , abstract , references , index terms 

XML message filtering problem involves searching for instances of a given, potentially 
large, set of patterns in a continuous stream of XML messages. Since the messages arrive 
continuously, it Is essential that the filtering rate matches the data arrival rate. Therefore, 
the given set of filter patterns needs to be indexed appropriately to enable real-time 
processing of the streaming XML data. In this paper, we propose AFilter, an adaptable, 
and thus scalable, path expression filtering approach. ... 

13 Reviewed articles: SIGAda 2001 workshop, "creating a symbiotic relationship 
between XML and Ada" 
Robert C. Leif 

September 2002 ACM SIGAda Ada Letters, volume xxii issue 3 
Publisher: ACM Press 

c II * ^ I Ki^ dPi ^^f/i R/iD\ Additional Information: full citation , abstract , references , citings, index 
Full text available: Tin pdrf 1 .39 Md) 

terms 

The purpose of the workshop was to organize the Ada community to take advantage of 
the opportunity to create Ada applications that are operating systems independent 
because they are based on a web technology, XML, Extensible Markup Language. The 
commercial use of the Internet is the driving force behind XML. Four elements of XML, 
which together are sufficient to build a web application, and all employ the same syntax 
were described. These are XML; its schema; the Extensible Stylesheet Language, ... 

1 4 First European workshop on XML and knowled g e management best papers: XML 
^ and the future of humanities computing 

^ Franco NIccolucci 
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April 2002 ACM SIGAPP Applied Computing Review, volume lO issue i 
Publisher: ACM Press 
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The existence of XML induces to hope that some limits of humanities computing may soon 
be trespassed. Here we mention some arguments concerning data management and 3D 
visualization, describing a few examples and test cases where the use of XML dramatically 
improved the quality of the application. These include text encoding, archaeological data 
management and Virtual Reality reconstruction of Cultural Heritage. 

Keywords: X3D, archaeological databases, cultural virtual reconstructions, medieval 
history 



15 extended cumulated gain measures for the evaluation of content-oriented XML 
^ retrieval 

^ Gabriella Kazai, Mounia Lalmas 

October 2006 ACM Transactions on Information Systems (TOIS), volume 24 issue 4 

Publisher: ACM Press 

Full text available: ^ pdf( 3 .25 MB) Additional Infomiation: full citation , abstract , references , index terms 

We propose and evaluate a family of measures, the extended Cumulated Gain (XCG) 
measures, for the evaluation of content-oriented XML retrieval approaches. Our aim is to 
provide an evaluation framework that allows the consideration of dependency among XML 
document components. In particular, two aspects of dependency are considered: (1) 
near-misses, which are document components that are structurally related to relevant 
components, such as a neighboring paragraph or container section, and (2) over ... 

Keywords: INEX, XML retrieval, cumulated gain, dependency, evaluation, metrics, near- 
miss, overlap 



16 E-Desig n Based on the Reuse Paradi gm 

L. Ghanmi, A. Ghrab, M. Hamdoun, B. Missaoui, K. Skiba, G. Saucier 
March 2002 Proceedings of tlie conference on Design, automation and test in Europe 
DATE '02 

Publisher: IEEE Computer Society 

Full text available: ^ pdf (244.78 KB) Additional Infonmation: full citation , abstract , citin gs 

This paper gives an overview on a VIrtualelectronic component or IP (Intellectual 
Property)exchange infrastructure whose main components area XML "well structured IP e- 
catalog Builder i"and a" XML IP profilers While the first module is ae^publishlng and an 
exchange management modulethe second has as role to extract from the 
designdirectories the IP files and to trigger their transferto the user site possibly-via an IP 
distribution serverunder the catalog control. Direct Design fileextraction fro ... 

17 INEX reports: Report on the ad-hoc track of the INEX 2005 workshop 
^ Mounia Lalmas, Gabriella Kazal 

^ June 2006 ACM SIGXR Forum, volume 40 issue i 

Publisher: ACM Press 

Full text available: ^pdf(214.09 KB) Additional Information: full citation , abstract , references , index terms 

The INitiative for the Evaluation of XML retrieval (INEX) has, since 2002, been working 
towards the goal of establishing an infrastructure, in the form of a large XML test 
collection and appropriate scoring methods, for the evaluation of content-oriented XML 
retrieval systems. In 2005, 47 organizations registered to participate in INEX. Throughout 
the year a number of groups dropped out due to resource requirements, while 11 further 
groups joined. INEX 2005 concluded with a total of 41 active gr ... 
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DB-2 (databases): data streams: EXPedite: a system for encoded XML processin g 

Yi Chen, George A. Mlhaila, Susan B. Davidson, Sriram Padmanabhan 

November 2004 Proceedings of the thirteenth ACM international conference on 

Information and Icnowledge management CIKM '04 
Publisher: ACM Press 

Full text available: ^pclf(217.31 KB) Additional Information: full citation , abstract , references , index terms 

As XML becomes an Increasingly popular format for Information exchange, the efficient 
processing of broadcast XML data on a constrained device (for example, a cell phone or a 
PDA) becomes a critical task. In this paper we present the EXPedite system: a new model 
of data processing in an Information exchange environment, which "migrates" the power 
of the data-sending server to receivers for efficient processing. It consists of a simple and 
general encoding scheme for servers, and streaming que ... 

Keywords: XML, XPath, binary encoding, query processing 



19 Testing and instrumentation: Experiences in coverage testing of a Java middleware 
Mehdi Kessis, Yves Ledru, Gerard Vandome 

September 2005 Proceedings of the 5th international worlcshop on Software 
engineering and middleware SEM '05 

Publisher: ACM Press 

Full text available: - glpdn 165.99 KB) Additional Infomiation: M. citatbn, abstract, references, citings, index 

terms 

This paper addresses the issues of test coverage analysis of J2EE [22] servers. These 
middleware are nowadays at the core of the modern information technology's landscape. 
They provide enterprise applications with several non functional services such as security, 
persistence, transaction, messaging, etc. In several cases, J2EE servers play a critical role 
when applied to e-business or banking applications. Therefore, ensuring the quality of 
such software layers becomes an essential requirement. ... 

Keywords: J2EE, JOnAS, code coverage testing, large scale software development, 
middleware, software engineering 



20 XMill: an efficient compressor for XML data 
Hartmut Liefke, Dan Suclu 

May 2000 ACI^ SIGMOD Record , Proceedings of the 2000 ACI^ SIGMOD international 

conference on Management of data SIGMOD '00, volume 29 issue 2 
Publisher: ACM Press 

Full text avallable: fg|Ddf(404.39KB) Additional Infomiation: MlLcitation. abstact. feferences. dtiDgs, index 

terms 

We describe a tool for compressing XML data, with applications in data exchange and 
archiving, which usually achieves about twice the compression ratio of gzip at roughly the 
same speed. The compressor, called XMill, incorporates and combines existing 
compressors In order to apply them to heterogeneous XML data: it uses zllb, the library 
function for gzip, a collection of datatype specific compressors for simple data types, and, 
possibly, user defined compressors for application specific data ... 
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181 Semistructured Data: X007: applying 007 benchmark to XML query processing tool B 
Ying Guang Li, Stephane Bressan, Gillian Dobbie, Zoe Lacroix, Mong Li Lee, Ullas Nambiar, 
Bimlesh Wadhwa 

October 2001 Proceedings of the tenth international conference on Information and 
■knowledge management CIKM '01 

Publisher: ACM Press 

r- ^ I ui 01 A-i KAn\ Additional Infomnation: full citation , abstract, references , citings, index 
Full text available: TO pdf(1.41 MB ) - ^ 

" terms 

If XML is to play the critical role of the lingua franca for Internet data interchange that 
many predict, it is necessary to start designing and adopting benchmarks allowing the 
comparative performance analysis of the tools being developed and proposed. The 
effectiveness of existing XML query languages has been studied by many, with a focus on 
the comparison of linguistic features, implicitly reflecting the fact that most XML tools 
exist only on paper. In this paper, with a focus on efficiency a ... 



Keywords: XML aware database, XML benchmarks, XML management systems, X007, 
native-XML database 



182 Semistructured Data: Induction of inte g rated yiew for XML data with hetero g eneous ^ 
^ DIDs 

^ Euna Jeong, Chun-Nan Hsu 

October 2001 Proceedings of the tenth international conference on Information and 
icnowledge management CIKM '01 

Publisher: ACM Press 

c II* ♦ I «n ^^/oonMDx Additional Infomnation: full citation , abstract , references , citings, index 

Full text available:™ pdf(2.90 MB) ; ^ 

^ terms 

This paper proposes a novel approach to integrating heterogeneous XML DTDs. With this, 
approach, an information agent can be easily extended to integrate heterogeneous XML- 
based contents and perform federated search. Based on a tree grammar inference 
technique, this approach derives an integrated view of XML DTDs in an information 
integration framework. The derivation takes advantages of naming and structural 
similarities among DTDs in similar domains. The complete approach consists of three 
main ... 

Keywords: XML DTD, distributed databases, federated search, intelligent agent, mark-up 
schemes, semistructured data 
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Sara Comai, Ernesto Damiani, Piero Fraternal! 

October 2001 ACM Transactions on Information Systems (TOIS), volume i9 issue 4 
Publisher: ACM Press 

Full text available- ISI Ddf{707 80 KB) A^^'*'^'^^' 'nfonmation: full citation , abstract , references , citings , index 

.^4J_a : terms 

The rapid evolution of XML from a mere data exchange format to a universal syntax for 
encoding domain-specific Information raises the need for new query languages specifically 
conceived to address the characteristics of XML. Such languages should be able not only 
to extract information from XML documents, but also to apply powerful transformation 
and restructuring operators, based on a well-defined semantics. Moreover, XML queries 
should be natural to write and understand, as nontechnical person ... 

Keywords: Document restructuring, graphical query languages, semantics 



184 Posters and Short Papers: SVG for navigating digital news video 

Mlchaej G. Christel, Chang Huang 
^ October 2001 Proceedings of the ninth ACM international conference on Multimedia 
MULTIMEDIA '01 

Publisher: ACM Press 

Full text available:^ pdf( 1.68 MB ) Additional Infonmation: full citation , abstract , references , index terms 

Scalable Vector Graphics (SVG) is a language for describing two-dimensional graphics in 
XML, specifically vector graphic shapes, images, and text. SVG is a new World Wide Web 
Consortium (W3C) Candidate Recommendation as of November 2000, and this paper 
describes how SVG provides an Ideal framework for presenting manlpulable, interactive 
summarizations Into a multimedia Information repository. Specifically, we present VIBE 
and map SVG interfaces Into a digital news video library for delivery thro ... 

Keywords: SVG, digital video library, surrogate 



185 Workshop reports: Cross-Language Chinese Text Retrieval in NTCIR Workshop: 
^ towards Cross-Lan guage multilingual Text Retrieval 

^ Kuang-hua Chen, Hsin-Hsi Chen 

September 2001 ACM SIGIR Forum, volume 35 issue 2 

Publisher: ACM Press 

Full text available: ^ pdf(685.26 KB) Additional Information: full citation , abstract , references 

This article reports the results of Chinese Text Retrieval (CHTR) tasks in NTCIR Workshop 
2 and the future plan of NTCIR workshop. CHTR tasks fall into two categories: Chinese- 
Chinese IR (CHIR) and English-Chinese IR (ECIR). The definitions, schedules, test 
collection (CIRBOlO), search results, evaluation, and initial analyses of search results of 
CHIR and ECIR are discussed in this article. The new plan of NTCIR towards multilingual 
Cross-Language Information Retrieval (CLIR) is also described. 

186 Using the web for document versioning: an implementation report for Delta V 
James J. Hunt, Jurgen Reuter 

July 2001 Proceedings of the 23rd International Conference on Software 

Engineering ICSE '01 
Publisher: IEEE Connputer Society 

Full text available: 151 pdf(95.21 KB) 

S Additional Information: full citation , abstract , references , index terms 

^ Publisher Site 

The current suite of systems that offer client/server capabilities for document versioning 
relies on proprietary protocols for communicating between a central versioning repository 
and a remote client In order to support better document authoring via the Web, the 
DeltaV worl<ing group of the Web-DAV (WWW Distributed Authoring and Versioning) 
project of the Internet Engineering Task Force Is working on a standard protocol for 
versioning over HTTP, The authors present a prototype of DeltaV b ... 
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^^'^ TIGRA — an architectural style for enterprise application integration 
Wolfgang Emmerich, Ernst Ellmer, Henry Fieglein 

July 2001 Proceedings of the 23rd International Conference on Software 

Engineering ICSE '01 
Publisher: IEEE Computer Society 

Full text available: 'Q pdf(1 37.99 KB) Additional Information: full citation , abstract , references , citing s. Index 
W Publisher Site 

We report on experience that we made In the Trading room InteGRation Architecture 
project (TIGRA) at a large German bank. TIGRA developed a distributed system 
architecture for integrating different financial front-office trading systems with middle- 
and back-office applications. We generalize the experience by proposing an architectural 
style that can be re-used for similar enterprise application Integration tasks. The TIGRA 
style is based on a separation of data representation using domain-s ... 

188 Representing and querying XML with incomplete informatiori 
Serge Abiteboul, Luc Segoufin, Victor Vianu 

May 2001 Proceedings of the twentieth ACM SXGMOD-SIGACT-SIGART symposium 
on Principles of database systems PODS '01 

Publisher: ACM Press 

Full text available* fSi Ddf(226 27 KB) Additional Infonmatlon: full citation , abst ract , references , citin gs, index 
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We study the representation and querying of XML with Incomplete information. We 
consider a simple model for XML data and their DTDs, a very simple query language, and 
a representation system for incomplete information in the spirit of the representations 
systems developed by Imiellnski and Lipski for relational databases. In the scenario we 
consider, the incomplete information about an XML document is continuously enriched by 
successive queries to the document. We show that our representation ... 

Automated reasoning with legal XML documents 
Laurence L. Leff 

May 2001 Proceedings of the 8th international conference on Artificial intelligence 
and law ICAIL '01 

Publisher: ACM Press 

Full text available- 1*1 pdf(26 43 KB) Additional Infonmatlon: full citation , abstract , references , citings, index 

■ _ ^ terms 

We have integrated the Jess Expert System tool from Sandia Labs [2] with the Xerces 
XML parser. We submit to this software contracts and court filings for litigation involving 
those contracts. These are written as per a contract standard submitted to the Legal XML 
standards group [5] and the court filing proposed standards. The software determines if a 
summary judgment request can be granted based on the submitted affidavits, contracts, 
and other documents. 



''^O Personalizin g E-commerce applications with on-line heuristic decision making 
Vinod Anupam, Richard Hull, Bharat Kumar 

April 2001 Proceedings of the 10th international conference on World Wide Web 
WWW '01 

Publisher: ACM Press 

Full text available: ^ pdf(261.12 KB) Additional Information: full citation , references , citin gs, index terms 
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191 Multidocument summarization via information extraction 

Michael White, Tanya Korelsky, Claire Cardie, Vincent Ng, David Pierce, Kiri Wagstaff 
March 2001 Proceedings of the first international conference on Human language 

teclinology research HLT '01 
Publisher: Association for Computational Linguistics 

Full text available: ^ pdf(72.44 KB) Additional Information: full citation , abstract , references , citings 

We present and evaluate the initial version of RIPTIDES, a systenn that combines 
Information extraction, extraction-based summarization, and natural language'generation 
to support user-directed multidocument summarization. 

192 Component selection and matching for IP-based design 
G. Martin, R. Seepold, T. Zhang, L Benini, G. De Micheli 

March 2001 Proceedings of the conference on Design, automation and test in Europe 
DATE '01 

Publisher: IEEE Press 
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193 XSLT for tailored access to a digtal video library 
Michael G. Christel, Bryan Maher, Andrew Begun 

January 2001 Proceedings of the 1st ACM/IEEE-CS joint conference on Digital 
libraries JCDL '01 

Publisher: ACM Press 



Full text available: Ddf(892.07 KB) 
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Surrogates, summaries, and visualizations have been developed and eval uated for 
accessing a digital video library containing thousands of documents and terabytes of data. 
These interfaces, formerly implemented within a monolithic stand-alone application, are 
being migrated to XML and XSLT for delivery through web browsers. The merits of these 
interfaces are presented, along with a discussion of the benefits in using W3C 
recommendations such as XML and XSLT for delivering tailored access to ... 

Keywords: XML, XSLT, digital video library, surrogate 



1 94 Prototy pe for wrapping and visual izing geo-referenced data in a distributed 
environment usin g XML technology 

Jianting Zhang, Muhammad Javed, Amir Shaheen, Le Gruenwald 

November 2000 Proceedings of the 8th ACM international symposium on Advances in 
geographic information systems GIS '00 

Publisher: ACM Press 

Full text available: ^pdf(618.21 KB) Additional Infonmation: full citation , abstract , citings , index temris 

This paper proposes a prototype for integration and visualization of geo-referenced 
Information (GRI) in a distributed environment in general and World Wide Web in 
particular. This prototype adopts a three-tier architecture and Includes three main 
components: GRI wrapper for distributed GRI web sites, GRI integration mediator and 
client side visualization interface. 

In this prototype, XML is used as a communication protocol between distributed web sites 
that provide GRI and the mediat ... 

Keywords: XML, geo-referenced information. Integration, visualization 
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September 2000 ACM SIGPLAN Notices /Proceedings of tlie fifth ACM SIGPLAN 

international conference on Functional programming ICFP '00, volume 
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Publisher: ACM Press 

Full text available- IB Ddfl5752QKB) Additional Information: full citation , abstract , references , citings, index 

' tenms 

We propose regular expression types as a foundation for XML processing languages. 
Regular expression types are a natural generalization of Document Type Definitions 
(DTDs), describing structures in XML documents using regular expression operators (i.e., 
*, ?, I, etc.) and supporting a simple but powerful notion of subtyping. The decision 
problem for the subtype relation is EXPTIME-hard, but it can be checked quite efficiently 
in many cases of practical interest. The subtyping algori ... 

196 j Rapture: A Capture/Replay tool for observation-based testin g B 
^ John Steven, Pravir Chandra, Bob Fleck, Andy Podgurski 

^ August 2000 ACM SIGSOFT Software Engineering Notes , Proceedings of the 2000 

ACM SIGSOFT international symposium on Software testing and analysis 

ISSTA '00, Volume 25 Issue 5 
Publisher: ACIVI Press 

Full text available: Q pdf(403.58 KB) Additional Infonmation: f uli citation , abstra ct, r eferences , dflngs. index 

We describe the design of jRapture: a tool for capturing and replaying Java program 
executions in the field. jRapture works with Java binaries (byte code) and any compliant 
Implementation of the Java virtual machine. It employs a lightweight, transparent capture 
process that permits unobtrusive capture of a Java programs executions. jRapture 
captures interactions between a Java program and the system, including GUI, file, and 
console inputs, among other types, and on replay it presents eac ... 

Keywords: Java, capture/replay, execution profiling, observation-based testing, software 
testing 



197 Interactive mathematics via the Web using MathML 
^ Francis J. Wright 

>/ June 2000 ACM SIGSAM Bulletin, volume 34 Issue 2 
Publisher: ACM Press 

Full text available: Qpdf (1.07 MB ) Additional Infonmation: full citation , abstract , index terms 

MathML is a mathematical markup language intended for displaying mathematics in web 
browsers. At present, It can be used to display mathematics generated dynamically in 
response to interactive queries only if the browsing and generating facilities are chosen 
carefully. This paper examines the background and possible options, and describes some 
of the details of the use of MathML to display the output from a web-based demonstration 
of an ordinary differential equation solver running in REDUCE ... 

198 XMill: an efficient compressor for XML data 
Hartmut Liefke, Dan Suciu 

May 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 

conference on Management of data SIGMOD '00, volume 29 issue 2 
Publisher: ACM Press 

Full text available- 151 pdf(404 39 KB) Additional Infonmation: full citation , abstract , references , citings, index 
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We describe a tool for compressing XML data, with applications in data exchange and 
archiving, which usually achieves about twice the compression ratio of gzip at roughly the 
same speed. The compressor, called XMIII, incorporates and combines existing 
compressors in order to apply them to heterogeneous XML data: it uses zlib, the library 
function for gzip, a collection of datatype specific compressors for simple data types, and. 
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possibly, user defined compressors for appiication specific data ... 
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C. D. Mote 

May 2000 Proceedings of the 2000 annual national conference on Digital 
government research dg.o '00 

Publisher: Digital Government Research Center 

Full text available: Q pdf( 539.99 KB) Additional Information: full citation , abstract 

As use of the Internet in commerce, education and personal communication has become 
common, the question of Internet voting in local and national elections naturally arises. In 
addition to adding convenience and precision, some believe that Internet voting may 
reverse the historical and downward trend of voter turnout in the United States. For these 
reasons President Clinton issued a memorandum in December 1999 requesting that the 
National Science Foundation examine the feasibility of online (In ... 
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